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EXTREMAL CUTS OF SPARSE RANDOM GRAPHS 


AMIR DEMBO*, ANDREA MONTANARI f , AND SUBHABRATA SEN 


Abstract. For Erdos-Renyi random graphs with average degree 7 , and uniformly random 7 -regular graph 
on n vertices, we prove that with high probability the size of both the Max-Cut and maximum bisection 
are n(j + P*\/l + 0 (^ 7 )) + o(n) while the size of the minimum bisection is n(^ — P, + 0 (^ 7 )) + o(n). 
Our derivation relates the free energy of the anti-ferromagnetic Ising model on such graphs to that of 
the Sherrington-Kirkpatrick model, with P* ss 0.7632 standing for the ground state energy of the latter, 
expressed analytically via Parisi’s formula. 


1. Introduction 

Given a graph G = (V,E), a bisection of G is a partition of its vertex set V = V\ U V% such that the 
two parts have the same cardinality (if \V\ is even) or differ by one vertex (if |U| is odd). The cut size of 
any partition is defined as the number of edges (i, j) € E such that i € Ui, and j € V2. The minimum 
(maximum) bisection of G is defined as the bisection with the smallest (largest) size and we will denote this 
size by mcut(G) (respectively MCUT(G)). The related Max-Cut problem seeks to partition the vertices into 
two parts such that the cut size is maximized. We will denote the size of the Max-Cut by MaxCut(G). The 
study of these features is fundamental in combinatorics and theoretical computer science. These properties 
are also critical for a number of practical applications. For example, minimum bisection is relevant for 
a number of graph layout and embedding problems [DPS02]. For practical applications of Max-Cut, see 
[PT95]. On the other hand, it is hard to even approximate these quantities in polynomial time (see, for 
instance [Has97, HZ02, FK00, Kho04]). 

The average case analysis of these features is also of considerable interest. For example, the study of 
random graph bisections is motivated by the desire to justify and understand various graph partitioning 
heuristics. Problem instances are usually chosen from the Erdos-Renyi and uniformly random regular 
graph ensembles. We recall that an Erdos-Renyi random graph G(n, m) on n vertices with m edges is a 
graph formed by choosing m edges uniformly at random among all the possible edges. A 7-regular random 
graph on n-vertices G Reg (n,7) is a graph drawn uniformly from the set of all graphs on n-vertices where 
every vertex has degree 7 (provided yn is even). See [BolOl, JLR00, Hof] for detailed analyses of these 
graph ensembles. 

Both min bisection and Max-Cut undergo phase transitions on the Erdos-Renyi graph G(n, [yn]). For 
7 < log 2, the largest component has less than n/2 vertices and minimum bisection is 0(1) asymptotically 
as n A 00 while above this threshold, the largest component has size greater than n/2 and min bisection 
is O(n) [LMOO]. Similarly, Max-Cut exhibits a phase transition at 7 = 1/2. The difference between the 
number of edges and Max-Cut size is 12(1) for 7 < 1/2, while it is H(n) when 7 > 1/2 [CGHS04], The 
distribution of the Max-Cut size in the critical scaling window was determined in [DMRR12], In this paper, 
we work in the 7 -A 00 regime, so that both min-bisection and Max-Cut are 12 (n) asymptotically. 
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Diverse techniques have been employed in the analysis of minimum and maximum bisection for random 
graph ensembles. For example, [Bol 8 z ] used the Azuma-Hoeffding inequality to establish that 5 — < 

mcut(G Reg (n,7))/n < \ 

Spectral relaxation based approaches can also be used to bound these quantities. These approaches 
observe that the minimum and maximum bisection problem can be written as optimization problems over 
variables oy € {—1, +1} associated to the vertices of the graph. By relaxing the integrality constraint to an 
L 2 constraint the resulting problem can be solved through spectral methods. For instance, the minimum 
bisection is bounded as follows (here C {— 1 , +l} n is the set of (zbl)-vectors with = 0 , assuming 

for simplicity n even) 


mcut(G) 


min 


G Ec 



l min {g_ • (, C G o ;)} > ]- \ 2 {C G ) 

Z <j£lln Z 


( 1 . 1 ) 


Here C G is the Laplacian of G , with eigenvalues 0 = Ai (C G ) < A 2 [C-g) < ■ • • \ n {C G ). For regular graphs, 
using the result of [Fri03], this implies mcut(G Reg (n, y))/n > ^ — ^7 — 1- However for Erdos-Renyi graphs 
X 2 (£ G ) = o(l) vanishes with n [KKM06] and hence this approach fails. A similar spectral relaxation yields, 
for regular graphs, , MCUT(G Reg (n, y))/n < j + ^7 — 1, but fails for Erdos-Renyi graphs. Non-trivial 
spectral bounds on Erdos-Renyi graphs can be derived, for instance, from [FO05, CO07]. 

An alternative approach consists in analyzing algorithms to minimize (maximize) the cut size. This 
provides upper bounds on mcut(G') (respectively, lower bounds on MCUT(G')). For instance, [Alo97] 


proved that all regular graphs have mcut(G)/n < j — y for all n large enough (this method was 
further developed in [DSW07]). 

Similar results have been established for the max-cut problem on Erdos-Renyi random graphs. In a recent 
breakthrough paper, [BGT13] establish that there exists 7 ) such that MaxCut(G(n, [ 7 n]))/n A 7 ) 
and following upon it [GL14] prove that M^) € [ 7/2 + 0.47523^/7,7/2 + 0.55909^/7]. 

To summarize, the general flavor of these results is that if G is an Erdos-Renyi or a random regular graph 
on n vertices with [yn/2] edges, then mcut(G)/n = 7/4 — ©(y^y) while MCUT(G)/n and MaxCut(G)/n 
behave asymptotically like 7/4 + ©(y^y). In other words, the relative spread of cut widths around its 
average is of order 1/y^y. Despite 30 years of research in combinatorics and random graph theory, even 
the leading behavior of such a spread is undetermined. 

On the other hand, there are detailed and intriguing predictions in statistical physics— based mainly 
on the non-rigorous cavity method [MM09], which relate the behavior of these features to that of mean 
field spin glasses. From a statistical physics perspective, determining the minimum (maximum) bisection 
is equivalent to finding the ground state energy of the ferromagnetic (anti-ferromagnetic) Ising model 
constrained to have zero magnetization (see [PIG + 08] and the references therein). Similarly, the Max- 
Cut is naturally associated with the ground state energy of an anti-ferromagnetic Ising model on the 
graph. The cavity method then suggests a surprising conjecture [ZB10] that, with high probability, 
MCUT(G !Reg (n, 7 )) = MaxCut(G Reg (n, 7 )) + o(n) = 717/2 — mcut(G Reg (n, 7 )) + o(n). 

The present paper bridges this gap, by partially confirming some of the physics predictions and provides 
estimates of these features which are sharp up to corrections of order no(y 7 ). Our estimates are expressed 
in terms of the celebrated Parisi formula for the free-energy of the Sherrington Kirkpatrick spin glass, 
and build on its recent proof by Talagrand. In a sense, these results explain the difficulty encountered by 
classical combinatorics techniques in attacking this problem. In doing so, we develop a new approach based 
on an interpolation technique from the theory of mean field spin glasses [GT02, GT04, Tal03]. So far this 
technique has been used in combinatorics only to prove bounds [FL03]. We combine and extend these ideas, 
crucially utilizing properties of both the Poisson and Gaussian distributions to derive an asymptotically 
sharp estimate. 
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1.1. Our Contribution. To state our results precisely, we proceed with a short review of the Sherrington- 
Kirkpatrick (SK) model of spin glasses. This canonical example of a mean field spin glass has been studied 
extensively by physicists [MPV86], and seen an explosion of activity in mathematics following Talagrand’s 
proof of the Parisi formula, leading to better understanding of the SK model and its generalizations (c.f. 
the text [Panl3] for an introduction to the subject). 

The SK model is a (random) probability distribution on the hyper-cube {—l,+l} n which assigns mass 
proportional to exp (/3H SK (a_)) to each ‘spin configuration’ a € {—l,+l} n . The parameter /3 > 0 is 
interpreted as the inverse temperature, with H SK (-) called the Hamiltonian of the model. The collection 
{i7 SK (<j) : a € { — 1,+l} n } is a Gaussian process on {—l,+l} n with mean E[i/ SK (<j)] = 0 and covariance 
E{ H SK (a)H SK (a')} = (a - of) 2 . This process is usually constructed by 

1 71 

H SK (a ) = — — ^ JijWj , (1-2) 

with {Jij} being n? independent standard Gaussian variables, and we are mostly interested in the ground 
state energy of the SK model. That is, the expected (over {Jij}) minimum (over of), of the Gaussian 
process H SK (gf) introduced above. 

Definition 1.1. LePDp be the space of non-decreasing, right-continuous non-negative functions x : [0,1] —x 
[0,/3]. The Parisi functional at inverse temperature j3 is the function Pp : T>p —x R defined by 

1 f 1 

p p[x] = /(0,0;s) --J qx(q)dq, (1.3) 

where / : [0,1] x E x Vp —x R, ( q,y,x ) e-x f(q,y,x) is the unique weak solution of the PDE with boundary 
condition 

% + + =0, = (l/^) lo g( 2 c°sh(/3y)) (1.4) 

among all continuous functions f(q,y) such that ^ £ L 2 ([ 0,1] x R). 

The Parisi replica-symmetry-breaking prediction for the SK model is 

P*,/3 = inf{P /3 [x] : x € Vp} . (1.5) 

We refer to [JT15, Proposition 7] for the uniqueness of such a solution of (1.4), and to [AC14] for the 
strict convexity of x i—X P^fx], which implies the existence of a unique global minimizer of P*,/ 3 - We are 
interested here in the zero-temperature limit 

P* = lim P* i/3 , (1.6) 

p — y<oo 

which exists because the free energy density (and hence P*^, by [Tal06]), is uniformly continuous in 1//3. 
It follows from the Parisi Formula [Tal06], that 

lim n -1 E[max{i/ SK (u)}] = P* . (1.7) 

7i—> oo g_ 

The partial differential equation (1.4) can be solved numerically to high precision, resulting with the 
numerical evaluation of P* = 0.76321 ± 0.00003 [CR02], whereas using the replica symmetric bound of 
[Gue03], it is possible to prove that P* < ^2/n ~ 0.797885. 

We next introduce some additional notation necessary for stating our results. Throughout the paper, 
O(-), o(-), and ©(•) stands for the usual n —x oo asymptotic, while 0 7 (-), o 7 (-) and @ 7 (-) are used to 
describe the 7 —X 00 asymptotic regime. We say that a sequence of events A n occurs with high probability 
(w.h.p.) if P(H n ) -A 1 as n —X 00 . Finally, for random {X n } and non-random / : M + —x M + , we say 
that X n = o 7 (/(y)) w.h.p. as n —X 00 if there exists non-random g( 7 ) = o 7 (/( 7 )) such that the sequence 
A n = {|X n | < g{ 7 )} occurs w.h.p. (as n —x 00 ). 
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Our first result provides estimates of the minimum and maximum bisection of Erdos-Renyi random 
graphs in terms of the SK quantity P* of (1.6). 


Theorem 1.2. We have, w.h.p. as n —>• oo, that 

mcut(G(n, [yn])) 
n 

MCUT(G(n, [yn])) 
n 


|-P*y| + o 7 (Vy), 

(1.8) 

2 ~i~ 0r r(Vr)- 

(1.9) 


Remark 1.3. Recall the Erdos-Renyi random graph Gj(n,p n ), where each edge is independently included 
with probability p n . Since the number of edges in G/(n, is concentrated around yn, with fluctuations of 
0(n l / 2+£ ) w.h.p. for any e > 0, for the purpose of Theorem 1.2 the random graph Gj(n, has the same 
asymptotic behavior as G(n , [yn]). 


Remark 1.4. The physics interpretation of Theorem 1.2 is that a zero-magnetization constraint forces a 
ferromagnet on a random graph to be in a spin glass phase. This phenomenon is expected to be generic for 
models on non-amenable graphs (whose surface-to-volume ratio is bounded away from zero), in staggering 
contrast with what happens on amenable graphs (e.g. regular lattices), where such zero magnetization 
constraint leads to a phase separation. 


We next outline the strategy for proving Theorem 1.2 (with the detailed proof provided in Section 2). For 
graphs G = ( V ., E ), with vertex set V = [n] and n even, we write a G f l n if the assignment of binary variables 
a = (<ri,..., a n ), Gi € { — 1, +1} to V is such that Yliev = 0- We further define the Ising energy function 
Hq(g) = i and let U-{G) = min {Hg(g) : a G O n }, 17+(G) = ma x{Hg(g) : a G O n }. It is 

then clear that 

mcut(G) = \\E\ + \ U~(G) , MCUT(G) = \\E\ + \ U+(G). (1.10) 

In statistical mechanics a is referred to as a ‘spin configuration’ and U-(G) (respectively U+(G)), its 
‘ferromagnetic (anti-ferromagnetic) ground state energy.’ 

The expected cut size of a random partition is taken care of by the term \\E\, whereas standard 
concentration inequalities imply that U + (G) and U—(G) are tightly concentrated around their expectation 
when G is a sparse Erdos-Renyi random graph. Therefore, it suffices to prove that as n —>• oo all limit 
points of n _1 E[t/-i-(G)] are within o 7 (yGy) of ±P*y / 27 . Doing so is the heart of the whole argument, and 
it is achieved through the interpolation technique of [GT02, GT04], Intuitively, we replace the graph 
G by a complete graph with random edge weights Jij/y/n for Rj independent standard normal random 
variables, and prove that the error induced on U±(G) by this replacement is bounded (in expectation) by 
nofly/fl). Finally, we show that the maximum and minimum cut-width of such weighted complete graph, 
do not change much when optimizing over all partitions erG {—1, +l} n instead of only over the balanced 
partitions a G Tt n . Now that the equi-partition constraint has been relaxed, the problem has become 
equivalent to determining the ground state energy of the SK spin glass model, which is solved by taking 
the ‘zero temperature’ limit of the Parisi formula (from [Tal06]). 

The next result extends Theorem 1.2 to y-regular random graphs. 


Theorem 1.5. We have, w.h.p. as n —>• oo, that 

mcut(G Reg (n, 7 )) 
n 

MCUT(G Reg (n, 7 )) 

n 


i~ p *'\Zi + ° 7 ^ ) ’ 

(1.11) 

1 + p *\[~i +0 

(1.12) 


The average degree in an Erdos-Renyi graph G(n, [yn]) is 2y so Theorems 1.2 and 1.5 take the same 
form in terms of average degree. However, moving from Erdos-Renyi graphs to regular random graphs 
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having the same number of edges is non-trivial, since the fluctuation of the degree of a typical vertex in an 
Erdos-Renyi graph is © 7 ( 7 / 7 )- Hence, any coupling of these two graph model yields about n^J 7 different 
edges, and merely bounding the difference in cut-size by the number of different edges, results in the too 
large spread. Instead, as detailed in Section 3, our proof of Theorem 1.5 relies on a delicate construction 
which “embeds” an Erdos-Renyi graph of average degree slightly smaller than 7 , into a 7 -regular random 
graph while establishing that the fluctuations in the contribution of the additional edges is only no 7 ( v / 7 ). 

Our next result, whose proof is provided in Section 4, shows that upto the first order, the asymptotic of 
the Max-Cut matches that of the Max bisection for both Erdos-Renyi and random regular graphs. 


Theorem 1.6. 

(a) W.h.p. as n —>• 00 , we have, 

MaxCut(G(n, [yn])) 
n 


(b) W.h.p. as n —>• 00 , we have, 

MaxCut(G Reg (?7,, 7 )) 
n 


\ + P *\[^ + °^)- 
4 + P *\/f + 


1 . 2 . Application to community detection. As a simple illustration of the potential applications of our 
results, we consider the problem of detecting communities within the so called ‘planted partition model’, 
or stochastic block model. Given parameters a > b > 0 and even n, we denote by Gi(n,a/n,b/n) the 
random graph over vertex set [n], such that given a uniformly random balanced partition [n] = V\ U V 2 , 
edges ( i,j ) are independently present with probability a/n when either both i,j € V\ or both i.j € V 2 , 
or alternatively present with probability b/n if either i £ V\ and j € V 2 , or vice versa. Given a random 
graph G, the community detection problem requires us to determine whether the null hypothesis Hq : G ~ 
G/(n, (a + 6)/(2n)) holds, or the alternative hypothesis Hi : G ~ G/(n, a/n, b/n) holds. 

Under the alternative hypothesis the cut size of the balanced partition (Vi,V^) concentrates tightly 
around nb/ 4. This suggests the optimization-based hypothesis testing 


T cut {G-e) 


0 if mcut(G) < 9, 
1 otherwise 


(1.13) 


and we have the following immediate consequence of Theorem 1.2. 


Corollary 1.7. Let 9 n = (6/4) + e n with e n y/n —>• 00 . Then, the test T cut ( •; 9 n ) succeeds w.h.p. as n 00 , 
provided (a — b) 2 > 8P^(a + 6 ) + o(a + b). 

Let us stress that we did not provide an efficient algorithm for computing T cut (but see [?] for related work 
that uses polynomially computable convex relaxations). By contrast, there exist polynomially computable 
tests that succeed w.h.p. whenever (a — b) 2 > 2 (a + 6 ) and no test can succeed below this threshold (see 
[DKMZ11, MNS13, Masl4]). Nevertheless, the test T cut is so natural that its analysis is of independent 
interest, and Corollary 1.7 implies that T cut is sub-optimal by a factor of at most 4P^ ~ 2.33. 


2. Interpolation: Proof of Theorem 1.2 

The Erdos-Renyi random graph G(n, m) considers a uniformly chosen element from among all simple 
(i.e. having no loops or double edges), graphs of n vertices and m edges. For m = [ 777 ,] and 7 bounded, 
such simple graph differs in only 0 ( 1 ) edges from the corresponding multigraph which makes a uniform 
choice while allowing for loops and multiple edges. Hence the two models are equivalent for our purpose, 
and letting G(n, [yn]) denote hereafter the latter multigraph, we note that it can be constructed also by 
sequentially introducing the [yn] edges and independently sampling their end-points from the uniform 
distribution on {1, • • - ,n}. We further let G R “ SS denote the Poissonized random multigraph G(n,N n ) 
having the random number of edges N n ~ Pois(yn), independently of the choice of edges. Alternatively, 
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one constructs by generating for 1 < i,j < n the i.i.d Zij ~ Pois(^) and forms the multi-graph 

on n vertices by taking (+ Zji) as the multiplicity of each edge / j (ending with multiplicity 

Z(ij) ~ Pois(^) for edge (i,j), i / j and the multiplicity ~ Pois(^) for each loop ( i,i ), where 
{z(j j), i < j, are mutually independent). By the tight concentration of the Poison) law, it suffices to 
prove Theorem 1.2 for G>)“ ss , and in this section we always take for G n a random multi-graph distributed 

o C /^Poiss 

aS u n ,7 * 


2.1. Spin models and free energy. A spin model is dehned by the (possibly random) Hamiltonian 
H : {—l,+l} ri —» R and in this paper we often consider spin models constrained to have zero empirical 
magnetization, namely from the set = {g_ £ {— l,+l} n : Ya= 1 Gi = 0}- The constrained partition 
function is then Z n (/3) = Y2aen n e~^ H ^ with the corresponding constrained free energy density 


4>n{P) = ~ E[logZ n (/3)] 
n 


1 

- E 

n 


log{ x: 


( 2 . 1 ) 


The expectation in (2.1) is over the distribution of the function ■) (i.e. over the collection of random 
variables {H(a)}). Depending on the model under consideration, the Hamiltonian (or the free energy) 
might depend on additional parameters which we will indicate, with a slight abuse of notation, as additional 
arguments of (•). 

For such spin models we also consider the expected ground state energy density 

e n = — E[ min H(a) ], (2.2) 

Tl 

which determines the large-/3 behavior of the free energy density. That is, </> n (/3) = ~/3e n + o(j3). We 
analogously define the maximum energy 

e n = — E[ max H(a) 1, (2-3) 

n aen n 

which governs the behavior of the free energy density as f3 —>• —oo. That is, (j> n (P) = —/3e n + o(/3) (in 
statistical mechanics it is more customary to change the sign of the Hamiltonian in such a way that /3 is 
kept positive). The corresponding Boltzmann measure on is 

M&nteO = YJj3) eX P{-P H (^)i • ( 2 - 4 ) 

A very important example of a spin model, that is crucial for our analysis is the SK model having the 
Hamiltonian H SK (-) of (1.2) on { — 1, +l} n and we also consider that model constrained to f2 n (i.e. subject 
to zero magnetization constraint). 

The second model we consider is the ‘dilute’ ferromagnetic Ising model on G>)°,( ss = (V, E ), corresponding 
to the Hamiltonian 

= ~ ai(T i ’ 

again restricted to a G Q n . We use superscripts to indicate the model to which various quantities refer. 
For instance ^>® K (/3) denotes the constrained free energy of the SK model, <^°(/3; 7 ) is the constrained free 
energy of the Ising model on G^“ ss , with analogous notations used for the ground state energies e® K and 
e°(7)- 

The first step in proving Theorem 1.2 is to show that mcut(G n ) and MCUT(G n ) are concentrated around 
their expectations. 
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Lemma 2.1. Fixing s > 0, we have that 

P mcut(G n ) — E[mcut(G n )] 

P M 


> ns 

> ns 


= 0(l/n) 
= 0(l/n). 


MCUT(G n ) -E[MCUT(G n )] 

Proof. Recall (1.1) that mcut(G n ) = \\E n \ + | U-(G n ), with \E n \ = N n ~ Pois([ 7 n]). Therefore, 

P[|mcut(G n ) -E[mcut(G„)]| > ns] < P[|E7_(G„) - E[C/_(G„)]| > rae] + P[|iV n - ElV n | > n s\ 

Var(C/_(G n )) , Var(iV n ) Var(C/_(G n )) 

— + 0(1/77,). 


< 


n 2 e 2 

We complete the proof for mcut(G n ) by showing that Var (U-(G n )) < ny. Indeed, writing U-(G n ) = /(z) 
for z = {zjj, 1 < z, j < n} and i.i.d. z V] ~ Pois( 7 /ra), we let denote the vector formed when replacing 
in z by an i.i.d copy zh. Clearly |/(z) — /(z( lJ ))| < |zjj — 2 ^-|. Hence, by the Efron-Stein inequality 
[BLM13, Theorem 3.1], 

Var (U-(G n )) < ^ J>[(/(z) - /(z^)) 2 ] < ^E[( % - 4) 2 ] , 

id 

yielding the required bound (and the proof for MCUT(G n ) = \N n + \U + (G n ) proceeds along the same 
line of reasoning). □ 

Next, recall that \E n \ ~ Pois( 7 n) has expectation yn, while e % = n^ 1 E[?7_(G n )] and 4 = ^ _1 E[C/ + (G ri )] 
(see (2.2) and (2.3), respectively). Hence, from the representation (1.1) of mcut(G n ) and MCUT(G n ), we 
further conclude that 


+ 


l2tr2 


i2 r 2 


—E[mcut(G n )] = J + ^44) 
n 2 2 


lE[MCUT(G n )] = J + ^( 7 ). 

n 2 2 


( 2 . 6 ) 


oo, 


(2.7) 

( 2 . 8 ) 


Combining (2.6) with Lemma 2.1, we establish Theorem 1.2, once we show that as n 

4 ( 7 ) = -V^ P * + o 7 ( v / 7 ) + °(4 
44) = + 427 P* + o 7 (v^y) + o(l). 

Establishing (2.7) and (2.8) is the main step in proving Theorem 1.2, and the key to it is the following 
proposition of independent interest. 

Proposition 2.2. There exist constants A±,A 2 < 00 independent of n, f3 and 7 such that 

P 


44’ 


,7 -<04 


< 71, 


^ + A 2 t 

y/T 7 


(2.9) 


We defer the proof of Proposition 2.2 to Subsection 2.2, where we also apply it to deduce the next 
lemma, comparing the ground state energy of a dilute Ising ferromagnet to that of the SK model, after 
both spin models have been constrained to have zero magnetization. 


Lemma 2.3. There exist A = A( 70 ) finite, such that for all 7 > 70 and any n, 


4 ( 7 ) 


\/27 


— e, 


SK 


< H 7 


- 1/6 


4 ( 7 ) 


oSK 


44 


< A'y 


- 1/6 


( 2 . 10 ) 


In view of Lemma 2.3, we get both (2.7) and (2.8) once we control the difference between the ground state 
energies of the unconstrained and constrained to have zero magnetization SK models. This is essentially 
established by our following lemma (whose proof is provided in Subsection 2.3). 

Lemma 2.4. For any 5 > 0, w.h.p. 0 < t/® K — U 4 < n^ +s , where 

tSK 


U7= min {H SK (a)}, 


U7= min{tf SK (a)}. 

crESZ n 


( 2 . 11 ) 
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Indeed, applying Borel’s concentration inequality for the maxima of Gaussian processes (see [BLM13, 
Theorem 5.8]), we have that for some c > 0, all n and 5 > 0, 


Uf-E[uf] 


> nS 


< 2e 


—cn8 2 


P[|l/S K -E[C/ n SK ]| >n6] <2e 


—cnS 2 


( 2 . 12 ) 

(2.13) 


Recall that e® K = n _1 E[I/® K ], whereas n^ 1 E[t/^ K ] —> — P* by (1.7). Consequently, the bounds of (2.12), 
(2.13) coupled with Lemma 2.4 imply that e® K -A — P* as n A oo. This, combined with Lemma 2.3 and 
(2.6), completes the proof of Theorem 1.2. 


2.2. The interpolation argument. We first deduce Lemma 2.3 out of Proposition 2.2. To this end, 
we use the inequalities of Lemma 2.5 relating the free energy of a spin model to its ground state energy 
(these are special cases of general bounds for models with at most c n configurations, but for the sake of 
completeness we include their proof). 


Lemma 2.5. The following inequalities hold for any n, j3 ,7 > 0: 


e°(7) + ^n(A7) 


Further, for any n, /3 < 0, 7 > 0, 

1 


6n(7) + ^Pn(P,T) 


< 


< 


log 2 

P ’ 

log 2 

\P\ ’ 


e SK + 


0 SK 


1 


/SK 


(P) 


(P) 


< 


< 


log 2 

P 

log 2 

\P\ 


(2.14) 


(2.15) 


Proof. Let H n (a) be a generic Hamiltonian for a € Ll n . One then easily verifies that 

9 ( MP) \ _Lfiu r r lQ g 2 ol 

dp V P ) n ^ E i s ^h,n)} e [ £ 2 ’°]’ 

for the Boltzman measure (2.4) and the non-negative entropy functional 5(/r) = — l°g Mil) 

which is at most log|H n |. Further, comparing (2.1) and (2.2) we see that P~ 1 p n (P) -A —e n when (5 -A 00 
(while n is fixed). Consequently, for any j5 > 0, 


6n + 


Pn(P) 

p 



(LLi) in <k|l 

\ u ) p 


We apply this inequality separately to the SK model and the diluted Ising model to get the bounds of 
(2.14). We similarly deduce the bounds of (2.15) upon observing that P^ 1 (f> n (P) -A — e n when p -a —00 
and recalling that with {lL® K (a)} a zero mean Gaussian process, necessarily e^ K = — e® K . □ 


Proof of Lemma 2.3. Clearly, for any n, P > 0 and 7 > 0, 


e °(7) sk 

< 

n 



1 




e£(7) + 4C( 


^ ^ A 7) 




+ 


1 


P^-P 


^ A 7) 


’( 


V27’ 


+ 


e SK + 


iSK 


/3' 


(/?) 


In view of (2.14), the first and last terms on the RHS are bounded by (log 2 )/ P- Setting p = 7 1 / 6 , we deduce 
from Proposition 2.2 that the middle term on the RHS is bounded by + H 27 _1 ^ 2 , yielding the first 

(left) bound in (2.10) (for A = log 2 + A\ + T 27 0 ). In case P < 0, starting from 


6n(7) . SK 


V^7 


< 


1 


^^(7) + 


+ 


^n K (P) 


+ 


6f - ~/*{P) 


and using (2.15), yields the other (right) bound in (2.10). 


□ 
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Proof of Proposition 2.2. For t S [0,1] we consider the interpolating Hamiltonian on 

H n h,t,a) := -^=H^ { 1 _ t) {g_) + \TtH SK (g _), 


(2.16) 


denoting by Z n (/3, 7 , t), 4> n (f3 , 7 , t) and pp, n ( m ', 7 , t), the partition function, free energy density, and Boltz¬ 
mann measure, respectively, for this interpolating Hamiltonian. Clearly, <j> n (f}, 7 ,0) = <^(^=, 7 ) and 
4>n(/3,l, 1 ) = 0n K (/3)- Hence, 


( 7 ^ 7 ’ 7 )“^ K ( /3 ) 


< 


f 

Jo 


d(j> n 


dt 


(13,1 ,t) 


dt 


and it suffices to show that |^p| is bounded, uniformly over t € [0,1] and n, by the RHS of (2.9). To this 
end, associate with i.i.d. configurations {gf,j > 1 } from pp iU (-\ 7 , t) and £> 1 , the multi-replica overlaps 


Qt — 


n 


i =1 


E(n^ 

3 = 1 


Then, denoting by (• )t the expectation over such i.i.d. configurations {of, j > 1}, it is a simple exercise 
in spin glass theory (see for example [FL03]), to explicitly express the relevant derivatives as 


dcfn 

dt 


(I3,l,t) 


dcfn 

dt 

d<f> , 


SK 


dt 


( df>n \ ( d(f> n \ 

[dt ) SK + {dt J D ’ 

E(1_e [<Ql>,]). 

tanh-)h) E[<Q|> S ]. 


—7 log cosh 


(3 


V*y 


+iJ2 


(-1 y 


1 =1 


(2.17) 

(2.18) 


For the reader’s convenience, we detail the derivation of (2.17) and (2.18) in Subsection 2.4, and note in 
passing that the expressions on their RHS resemble the derivatives of the interpolating free energies obtained 
in the Gaussian and dilute spin glass models, respectively (see [GT02], [GT04]). 

Now observe that \Qt\ < 1 for all £ > 2 and Q\ = 0 on Q n , hence 


d(j) n 

dt 


(f3,l,t) 


< 7 


log cosh 



t +1 

4 7 2 



2 7 


+ 7E7 


1 =3 


tanh 


P 


V^l 


The required uniform bound on | | is thus a direct consequence of the elementary inequalities 

| log cosh ir- ^x 2 \ < Cix 4 , \y 2 - x 2 \ < C 2 x 4 , | - log(l - y) - y - ^y 2 \ < C' 3 |a;| 3 , 

which hold for some finite C 1 , C 2 , C 3 and any y = | tanh x \. □ 


2.3. Proof of Lemma 2.4. Recall that H SK (a) = — Jcr w ^ere J = {Jij = (Jij + Jji)/V 2 : 1 < 

i , j < n} is a GOE matrix. Since {Jo} do not affect [7® K — t/ n K , we further set all diagonal entries of J to zero. 
By symmetry of the Hamiltonian H SK (•), the configuration a* that achieves the unconstrained ground 
state energy H SK (a*) = t/^ K is uniformly random in {—1, +1} T1 . Therefore, S* := | Ya =1 is a centered 
Bin(n, 1/2) random variable, and by the LIL the events B n = {|S*| < b n } hold w.h.p. for b n := y/n log n. 
By definition U n > — §A max (J/y / n), hence the events C n = {U n > —2 n} also hold w.h.p. by the 
a.s. convergence of the largest eigenvalue A max (-) for Wigner matrices (see [AGZ09, Theorem 2.1.22]). 
Consequently, hereafter our analysis is carried out on the event {B n nC„} and without loss of generality 
we can and shall further assume that 5* > 0 is integer (since n is even). 
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; E 


Since a* is a global minimizer of the quadratic form H SK (a ) over the hyper-cube {—1, l} n , necessarily 
of = sign(/f) for 

/* ~ — 

* 

Consequently, under the event C n , 

n n 

-2 n < Vf = H SK (a*) = - £ a*f* = - £ |/*| , 

2=1 2=1 

hence i?* := {i € [n] : |/*| < 6} is of size at least (2/3)n. Thus, for n > 6 b n , under the event B n n C n we 
can find a collection VC* C {i £ R* : of = +1} of size 5* and let g_ € be the configuration obtained by 
setting ai = —a* = — 1 whenever i € VC* while otherwise cij = a*. We obviously have then that 

ttSK 


2 4 2 
Z7r = H SK {a*) < U* K < H SK (a) 
Further, by our choices of g_ and VC* C R*, also 

2 

Ri 


(2.19) 


H bK (a) - H tiK {a*) = 


E E * 

i€-W* j^[rx]\ 


v a j 


< 


4 E i/*i + -4 A (^*) ^ 245 - + -i A (*n 

iew* y ' n 


( 2 . 20 ) 


where we define, for VC C [n] the corresponding partial sum 

A(W):= £ 141, 

of (I” ^) i.i.d. variables 4 . Under the event B n we have that S* < b n < y n := so by (2.19) and 

(2.20) it suffices to show that w.h.p. (A(VC*) < x n } for x n = \Jny n . To this end, note that by Markov’s 
inequality, for some c > 0, all n and any fixed VC of size |VC| < b n , 

P(A(VC) > x n ) < e~ Xn E[e^f n < e~ CXn . 

With at most 2 n such VC C [n], we conclude that 

P(sup{A(VC) : VC C [n], |VC| < b n } < x n ) -A 1, 
and in particular w.h.p. {A(VC*) < x n } (under B n = {5* < 6 n }). 


2.4. The interpolation derivatives. Recall the Hamiltonian Hn( 7 ;G<l) of (2.16), the corresponding 
partition function Z n (/3, q,f) and free energy density 4> n (f3,'y,t). We view n~ 1 log Z n ((3, 7 , t) := ip n (t,z, J), 
as a (complicated) function of the Gaussian couplings J = {4 : 1 < i,j < n} and the Poisson multiplicities 
z = {zij : 1 < i,j < n}. Denoting by p(t, •) the Pois( 7 (l — t)/n ) probability mass function (pmf) of Zij 
yields the joint PMF p(f,z) = Yli<i j< n P(^Rij)^ an d tlm expression 


where y = 
M. Thus, 


4 (/ 3 , 7 ,t) = E[ 4 (t,z, J)] = J ip n (t, z, J)p(t, z)d/i(z, J) 


( 2 . 21 ) 


(z/pj)” < 8 > (z^r)” for the counting measure z-tj on N and the standard Gaussian measure v r on 


d(k 

dt 


= J -fiftt’Z, J)p(^,z)d/r(z, J) + J 4 (t,z, J)^(t,z)d/r(z, J) 
d<t> r 


\ f dffin 

dt Js K V dt /D 


( 2 . 22 ) 
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Proceeding to verify the expression (2.17), here = ~^H SK (since i7^ 1 _^(-) depends on t only through 
the PMF of z). Hence, 


| [log ZM 7 , t)] = ,*,£)), = -i^( ffSK ( 2 )), • 




resulting with 


St 


1 P 


sk n 2y/t 


E Z (E j[(H SK (a)) t ]) . 


The expression on the RHS of (2.17) then follows by an application of Gaussian integration by parts to 
Ej[(i 7 SK (< 7 ))i], as illustrated for example in [Panl3, Lemma 1.1]. 

Next, to establish (2.18) let hij(zij) := E[i/> n (f, z, J)|zy], and note that the product form of p(t, z) and 
/r(z, J), results with 



(2.23) 


The iji-th integral on the RHS of (2.23) is merely the value of (—'y/n)g > (\), where g(X) = E[/(z)] for 
/ = hij and z ~ Pois(A) at A = 7(1 — t)/n. Differentiating the Pois(A) PMF one has the identity g'( A) = 
E [f(z + 1) — f(z)] (under mild regularity conditions on /). This crucial observation transforms (2.23) into 

(ir) + ( 2 - 24 ) 

' ' D i =1 j —1 


Here i/) n (t, •, ■) = n 1 log Z n (fj, 7 , t) and adding one to z t j corresponds to an extra copy of the edge (i,j) in 
the dilute Ising model of Hamiltonian -^==77 ^_^(ct). Consequently, setting b := 

hij(zij + 1) - hij(zij) = - log /e brTi<Tj \ = - log | cosh(6) [l + tanh (b)(aiaj) t \ } , (2.25) 

since e by = cosh(6)[l + tanh(6)y] for the {—1, +l}-valued y = OjOj. Combining (2.24) and (2.25), we obtain 
by the Taylor series for — log(l + x) (when — 1 < x < 1), that 


dp n 

dt 


D 


n‘ 


EE E [log {cosh ( 6 ) [1 + t&nh(b)(criaj)t]}} 


i= 1 j =1 


-7 log cosh (7) + 7 ^2 


(-i)' 


e =1 

00 


tanh(m E 


< r 1 


^2 S 


*,i =1 


= -7 


log cosh( 6 ) + 7 ^2 —r~ (tanh(&)) E 


1 = 1 


as stated in (2.18). 


3. Graph Comparison: Proof of Theorem 1.5 

The notion of uniform random 7 -regular graph refers to drawing such graph uniformly from among all 
7 -regular simple graphs on n-vertices, provided, as we assume throughout, that 717 is even. We instead 
denote by G Rcg (n, 7 ) the more tractable configuration model, where each vertex is equipped with 7 half¬ 
edges and a multigraph (of possible self-loops and multiple edges) is formed by a uniform random matching 
of the collection of all 771 half-edges. Indeed, as mentioned in the context of Erdos-Renyi graphs (see start 
of Section 2), for 7 bounded the matching in G Reg (n, 7 ) produces a simple graph with probability bounded 
away from zero, and conditional on being simple this graph is uniformly random. Consequently, any 
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property that holds w.h.p. for the configuration model multigraph G Reg (n, 7 ) must also hold w.h.p. for 
the simple uniform random 7 -regular graph. 

Our strategy for proving Theorem 1.5 is to start from the random regular multigraph G 1 G Reg (n, 7 ), 
deleting some edges and “rewiring” some of the existing ones to obtain a new graph G 2 which is approx¬ 
imately an Erdos-Renyi random graph of n 7_/2 edges, where 7 _ := 7 — ^7 log 7 . Then, with Theorem 
1.2 providing us with the typical behavior of extreme bisections of G 2 , the main challenge is to control the 
effect of our edge transformations well enough to handle the minimum and maximum bisections of G\. 

Specifically, drawing i.i.d. Xi ~ Pois( 7 _), we let Zj := (7 — Xf) + and color Z* of the 7 half-edges of each 
vertex i € [n] by BLUE (B). All other half-edges are colored RED (R). Matching the half-edges uniformly, 
without regard to their colors, we obtain a graph G\ ~ G Reg (n, 7 ). Our coloring decomposes G\ to the 
sub-graph Grr consisting of all the RR edges and Grb U Gbb having all other edges, which we in turn 
decompose to the sub-graph Gbb consisting of the BB edges and Grb having all the multi-color edges (i.e. 
RB and BR). To transform G\ to G 2 , we first delete all edges of Gbb 5 disconnect all the multi-colored RB 
edges and delete all the B half-edges that as a result became unmatched. We then form a new sub-graph 
Grr by uniformly re-matching all the free R half-edges (in case there is an odd number of such half-edges we 
leave one of them free as a self-loop). The graph G 2 has the vertex set [n] and E(G2) = E(Grr) U E(Grr). 

We represent by £l n the collection of all bisections for a graph G having n vertices, denoting by cutc(<l) 
the cut size for the partition between {i € [n] : = — 1} and its complement. Then, for any a € £l n we 

have 


cutci (a) = cutG 2 (oi) - cutg RR (u) + cutG RB uG B B (z.) ■ (3.1) 

We control the LHS of (3.1) by three key lemmas, starting with the following consequence of Theorem 1.2, 
proved in Subsection 3.1 that gives sharp estimates on the dominant part, namely cutG 2 («l)- 


Lemma 3.1. We have, w.h.p. as n —>• 00 , 

mcut(G2) 

n 

MCUT (G 2 ) 
n 


■j" _ p * \j\ + 0r y (\/t) > 

(3.2) 


(3.3) 


Our next lemma, proved in Subsection 3.2, shows that while both the B half-edge deletions and the R 
half-edge re-matching that follows, may affect the cut size, on the average (with respect to our random 
matching), at the scale of interest to us they cancel out each other. 


Lemma 3 . 2 . Uniformly over all a_ € Q n , 

E[cutG RB (<l)] = n ^ v^ogT + <3^^ + 0 ( n ) ; (3.4) 

®[ cut G RR (—)] = n ( V ^^° g7 + + °( n ) > ( 3 - 5 ) 

E[cutG BB (oi)] = n + o 7 (l)^ + o{n). (3.6) 

The last result we need, is the following uniform bound on the fluctuations, proved in Subsection 3.3, 
that allows us to control the effect of the edge rewiring on the extremal bisections. 


Lemma 3.3. There exists C sufficiently large, independent of n and 7 , such that 

sup |cut, 4 (a) - E[cut^(a)]| > Cn^y 1 ^ \/log 7 = o(l) 


(3.7) 


where A may be distributed as Grb U Gbb or Grr. 
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Turning to prove Theorem 1.5, we have from (3.1) and Lemma 3.3 that w.h.p. as n A oo, 


sup 

<xgf2 n 


cut Gl (a) - cut Ga (a) + E[cutg RR ( ct)] - E[cut GRBUGeB (a;)] = no 7 ( v / 7 ). 


In view of Lemma 3.2, we deduce from (3.8) that w.h.p. as n -A 00 , 

V7 1 °g7 


sup 


cut Gl (ct) — cut Ga (a) — n- 


= + o(n). 


This in turn implies that w.h.p. 


mcut(Gi) = mcut(G 2 ) + n^ 1 ' g7 + no 7 (y/ 7 ) + o(n ), 
MCUT(Gi) = MCUT(G 2 ) + n ^ lQg7 + nay(y/j) + o(n), 


(3.8) 


and Theorem 1.5 thus follows from Lemma 3.1 (recall that 7 = 7 - + ^7 log 7 ). 


3.1. Proof of Lemma 3.1. Let G ™ 1 be the random graph generated from the configuration model with 

1.1. d. Xi ~ Pois( 7 _) degrees. We denote by G clon ( n > 7 -) the sub-graph obtained by independently deleting 
each half-edge of G 1 ^ with probability 1 /n, before matching them. By the thinning property of the Pois law, 
G clon (n, 7 _) has the law of the Poisson-Cloning model, where one first generates i.i.d. Q ~ Pois(^^ 7 _), 
then draws a random graph from the configuration model with Q half-edges at vertex i. Recall [Kim06] that 
the G/(n, ^-) and G clon (n, 7 _) models are mutually contiguous. Further, 7-/7 —>• 1, and so by Theorem 

1 . 2 . w.h.p. 


mcut(G clon (n,7_)) 

n 

MCUT(G clon (n, 7 _)) 

n 


x + p *\j\ + ■ 


(3.9) 

(3.10) 


Next note that for any two graphs Qi,Q 2 on n vertices, |MCUT(<5i) — MCUT(C/ 2 | < |£ , (C?i)A£ , (C/ 2 )| and 
|mcut(C/i) — mcut(t? 2 | < \E(Qi)AE(Q 2 )\- W.h.p. our coupling has Xa(X — C i) = 0(1) half-edges from G ™ 1 
not also in G clon (n, 7 _). Hence |l?(G“ t )AE(G clon (n, 7 _))| = 0(1) and (3.9)-(3.10) extend to mcut(G™ t ) 
and MCUT(G“ t ), respectively. 

We proceed to couple G™* and G 2 such that |E(G(f t )A£ , (G 2 )| < no 7 ( v / 7 ) w.h.p. thereby yielding 
the desired conclusion. To this end, G 2 could have alternatively been generated by one uniform random 
matching of only the X[ := min{Xj, 7 } RED half-edges that each vertex i has in Gi (for completeness, 
we prove this statement in Lemma 3.4). We can thus couple G 2 and G™* by first forming G™*, then 
independently for i = 1,... ,n color in RED uniformly at random X\ of the X{ half-edges of vertex i, with 
all remaining half-edges colored BROWN. Now, to get G 2 we delete all BB edges, disconnect all RB edges 
and delete the resulting B half-edges, then uniformly re-match all the free R half edges (for Lemma 3.4 
applies again in this setting). The claimed bound on |E(G(( lt )AI?(G 2 )| follows since the total number of B 
half-edges in G ™ 1 is w.h.p. at most 

2nE(Ax - X[) = 2nE[(X 1 - 7 )+] = n0 7 (l), (3.11) 

where the RHS follows by Normal approximation to Pois( 7 _) and our choice of 7 _ = 7 — 7 / 7 log 7 - 


3.2. Proof of Lemma 3.2. We first prove (3.4), utilizing the fact that the distribution of cut GRB (a) is 
the same for all a £ H n . Hence, 


E[cut GRB (a)] = E [ E 2 * [cut GRB (a*)] ] 


(3.12) 
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for er* chosen uniformly from £l n . Given the graph G\, we have 

\E rb \ 


Eo.*[cutG RB (a*)] = 


2(1 - l/n) 


(3.13) 


where E^b denotes the set of RB edges in G\ excluding self-loops. Next, noting that the expected number 
of edges in G\ excluding self-loops is and the probability that an edge connecting two distinct 


vertices is coloured RB is 2 ^^(1 — we have, 


E 


(3.14) 


L J n'y — 1 7 V 7 / 

where Z i ~ (7 — X\) + and X\ ~ Pois( 7 _). We get (3.4) out of (3.12) and (3.14) upon observing that 

E[Zi] = 7 - EpG] + E[(7 - Xi)_] = 7 - 7 _ + E[(Xi - 7)+] = y/ylogy + 0 7 ( 1) (3.15) 

(see (3.11) for the right-most identity). By an analogous calculation, we find that for all a € £l n , 


E[cut GBB (£)] = 


n(n — l)y 2 /E[Zi] \ 2 


= ^[T(log7 ) 2 + o 7 ( 1 )] +o{n). 


2 (nq — 1) V 7 J 2(1 - l/n) 

Turning to (3.5), the same argument as in (3.12) implies that 

E [ Cut G RR ( 2 )] = E [ E -* [ cut G RR ( 2 *)] ] » 

for <7* chosen unifomly from £l n . Further, similarly to (3.13) we find that given the graph Grr, 


E, 


E r[^t s m (g*)] = 2(1 - 1/n) 


(3.16) 


where E 2 denotes the set of edges in Grr excluding self-loops. Recall that |E(Grb)| — |Erb| and ^|2£(Grb)| — 
IE 2 I count the number of self-loops in Grb and Grr, respectively. The expected number of such self-loops 
is 0(1) as n A 00 , hence E[|E 2 |] = ^E[|Erb| ] + 0(1), which upon comparing (3.13) to (3.16) yields the 
required expression of (3.5). 


3.3. Proof of Lemma 3.3. Starting with A = Grb U Gbb clearly, for any x n > 0, 

P sup |cut_ 4 (a) - E[cut^(a)]| > 2 x n < p\{n) + P 2 (n) 

L cr£Q n 

where Z = (Z\, ■ ■ ■ , Z n ) count the number of BLUE half-edges at each vertex of G\ and 

Pi(n) = P sup |cut_ 4 .(a) — c(a, Z)\ > x r 

P 2 {n) = P sup |c(ct, Z) — E[cut^(a)] | > x r 

1 1 


for c(< 7 , Z) := E[cut_ 4 (a)|Z], Letting S n ( Z) = Y^i=i %%■, note that w.h.p. Z G £ n for £ n 
nE[Zi] | < b n } and b n = \Jn log n. Hence, by a union bound over a € £l n we get that 


Pi (n) G 2” max max ] 

zeSn 71 


|cut^(a) - c(a,Z)| > x r . 


Z = 


+ o(l). 


(3.17) 


(3.18) 

(3.19) 
{z : \S n ( z) - 


(3.20) 


We next apply Azuma-Hoeffding inequality to control the RHS of (3.20). To this end, fixing z € £ n and 
half-edge colors such that {Z = z}, we form G 1 by sequentially pairing a candidate half-edge to uniformly 
chosen second half-edge, using first BLUE half-edges as candidates for the pairing (till all of them are 
exhausted). Then, fixing a € f l n , we consider Doob’s martingale M *. = E[cutyi(o)|^'fc], for the sigma- 
algebra generated by all half-edge colors and the first k > 0 edges to have been paired. This martingale 
starts at Mq = c(a, Z), has differences |M& — Mfc_i| uniformly bounded by some universal finite non- 
random constant k (independent on n, g_ and z), while Mi = cutler) for all t > S n ( z) (since the sub-graph 
A = Grb U Gbb is completely formed within our sequential matching first S n ( z) steps). The bounded 
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difference property of M follows easily from the “switching” argument in [Wor99, Theorem 2.19]. Thus, 
from Azuma-Hoeffding inequality we get that for z £ £ n , 


|cut. 4 (a) — c(a, Z)| >x r 


Z = z 


< 2 exp — 


< 2 exp — 


8« 2 (nE[Zi] + b n 


(3.21) 


8K 2 S n (z) 

Recall (3.15) that E[Zi] = ^/ylogy + 0 7 (1), hence choosing x n = Cny^Vlogy for some C 2 > 8 k 2 log3, 
we find that the RHS of (3.20) decays to zero as n —> oo. 

Turning to control P 2 (n), for i € [n] and 1 < j < Z{, let I %3 (a) = 1 if the j th B half-edge of vertex i is 
matched to some half-edge from the opposite side of the partition induced by a, and Iij{o _) = 0 otherwise. 
Then, 


n Zi 

cut^(u)= EE lij fe) cu tG BB (ll) 
i=1 j— 1 


(3.22) 


For i such that ey = 1 and 1 < j < Zi we similarly set I[- (a) = 1 if the j th B half-edge of vertex i is 
matched to a B half-edge of a vertex from the opposite side, and Jh = 0 otherwise. Clearly then 

Zi 

cutc BB (<l) = 4 ( 2 ) > 

{i:<Ti=l} j= 1 

so setting S+ (a, Z) := Yl{i ai= 1 } we have from (3.22) that 

71 Zi Zj_ 

c(a, Z) = ^^P[/ ii ( £ ) = l|Z]- ]T 5>[4(£) = 1|Z] 

i=l j = l {i:o-i = l} 1=1 

c ,„>7)/2 ^5 n (Z)-5+(u,Z) 

— b n {Zi) S n ((7, Zj 

ny — 1 ny — 1 

Considering the extreme values of the RHS of (3.23) yields that for all a € H n , 


(3.23) 


2 v 2 ny 


1 


<c(a, Z) 1-< -S„(Z), 

ny 


1 


from which we deduce that 

E[Z; 


n- 


±i(l - 4-4) +o(n)< inf inf{c( £ ,Z)} 

' ' Ze£n crG^n 

jgr^l] 

< sup sup {c(a, Z)} < n——— + o(n). 


ZeEn ££7 n 

Further, while proving Lemma 3.2 we have shown that 

E[Zi] rry 


(3.24) 


E [cut , 4 ( 0 ;)] = n- 

hence from (3.24) and (3.15) it follows that 


2 (ny — 1 ) 


1 - 


nzih 

2 y / 


sup sup I c(a, Z) — E[cut Act)] I < n- 1 - o(n) < n(logy ) 2 + o(n), 

1 4y 


ze£ n ctSO n 


and since w.h.p. Z € £ n , we conclude that P 2 (n) = o(l). 

Next we consider the graph A = Grr and proceeding in a similar manner we have the decomposition 
(3.17), except for replacing in this case Z in (3.18)-(3.19) by the count Y = (Y\, Y 2 , • • • , Y n ) of number of R 
half-edges at each vertex at the initiation of the second stage. The total number S n (Y) of R half-edges to 
be matched in the second step is less than the initial number S n { Z) of B half-edges. Consequently, if Z € £ n 
then Y € £+ := {y : S n ( y) < nE[Zi] + b n }, and we have again a bound of the type (3.20) on pi(n), just 
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taking here the maximum over y G £+ instead of z G £ n . Further, we repeat the martingale construction 
that resulted with the RHS of (3.21). Specifically, here X o is the sigma-algebra of Y, namely knowing the 
degrees of vertices in Grr, and we expose in the first k edges to have been paired en-route to the 
uniform matching that forms Grr. As before the Doob’s martingale M^ = E[cut ( g RR (CT)|J : fc] has uniformly 
bounded differences, starting at Mq = c(a, Y) and with the same choice of x n the desired bound on p±(n) 
follows upon observing that Mg = cutg RR (a) for all l > 5 n (y), hence as soon as t = nE[Zi] + b n . Turning 
to deal with P 2 (n) in this context, by the same reasoning that led to (3.23) we find that for 5 = 5 n (y) > 2 
and S + (a) = 5+(ct, y), 


c(u,y) = E[cutg RR (u)|Y = y] = 


5+ (a )(5 - 5+(a)) _ 5 2 - (25+ (cr) - sf 


5-1 


4(5-1) 


(3.25) 


While proving Lemma 3.2 we have shown that 


c n := 4E[cutg RR (a)] = n[E[Zi] + 0 7 (1)] + o(n) 

and that c n is constant over a G fl n . As c n > 6x n for n and 7 large, while |5 2 /(5 — 1) — 5| is uniformly 
bounded, we deduce from (3.25) that for any y n < x n , 


{|25+(<r) — S'\ < x n } P){|5 — S'\ < y n } |^){|5 / — c n \ < x n } = 4 > {| 4 c ( ct , y) - c n | < Ax n } . 

Now, w.h.p. S' = 5 n (Z) is in X n := [nE[Zi] — b n ,nE[Zi] + b n \ with | S' — c n \ < x n , and taking then the 
union over a_ G fi n , we get similarly to the derivation of (3.20) that 


P 2 (n) <2 n max max P |25+(u) — s n \ > x n , S > s n — y n \ S' = s r 

n L 

+ max P(5 < s n - y n I S' = s n ) + o(l) =: p 3 (n) + p±{n) + o(l). 

Sn£l n 


(3.26) 


Starting with 2 N = 77.7 half-edges of G\ of whom S' = s n colored B (while all others are colored R), the 
non-negative number S' — S of half-edges in Gbb is stochastically dominated by a Bin(s„, s n /(2N — s n )) 
random variable. For s n G X n the latter Binomial has mean nE[Zi ] 2 /(7 — E[Zi]) + o(n), hence in view 
of (3.15), piin) = o(l) provided y n > 3 n(log 7 ) 2 . For bounding pz{n) we assume w.l.o.g. that cr = 1 iff 
i < n/ 2, with 5+(a) the total number of R half-edges for vertices i < n/ 2, which are matched to B half¬ 
edges by the uniform matching in the first step. Fixing the total number s n of B half-edges in Gi, clearly 
5 +(ct) is stochastically decreasing in the number S' + = Y^=i of B half-edges among vertices i < n/2. 
Thus, it suffices to bound pz{n) in the extreme cases, of S' + = 0, and of S' + = s n . The uniform matching 
of the first step induces a sampling without replacement with 5+(a) denoting the number of marked balls 
when drawing a sample of (random) size 5 G (s n — y n , s n ], uniformly without replacement from an urn 
containing 2N — s n balls, of which either N or IV — s n balls are marked. By stochastic monotonicity, it 
suffices to consider the relevant tails of 5+(<r) — s n / 2 only in the extreme cases of S = s n — y n and 5 = s n . 
As 2 N/n = 7 , s n /n <S 7 and y n <S x n , standard tail bounds for the hyper-geometric distribution [Chv79] 
imply that p^n) = o(l) for 7 sufficiently large, thereby completing the proof. 


3.4. A pairing lemma. We include here, for completeness, the formal proof of the fidelity of the two 
stage pairing procedure (which was used in our preceding arguments). 

Lemma 3.4. Given 21 labeled balls of color R and 2m labeled balls of color B for some m < t, we get a 
uniform random pairing of the R balls by the following two step procedure: 

(a) First match the 2{m +1) balls uniformly at random to obtain some RR. RB and BB pairs. 

(b) Remove all B balls and uniformly re-match the R balls which were left unmatched due to the removal 
of the B balls. 

Proof. We use the notation (2k — 1)!! = (2k — 1)(2 k — 3) • • • 1 and [m\k = m(m — 1) • • • (m — k + 1) and let 
PZ denote the random pairing of the 2 i R balls by our two-stage procedure (which first generated 2s pairs 
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of type RB, (rri — s ) of type BB and (£ — s) of type RR). We then have that for any fixed final pairing P of 
the R balls, 


p] = E 


A [2m] 2s (2(m — s) — 1)!! 
) (2m+ 21- l)!!(2s - 1)!! 


£\(2m)\2 e 


E 22 ‘ 


m + £ 


(2m + 2£)\j^ \m-s,£-s,2sj (2£-l)!!’ 


where the last identity follows upon observing that ^"=o ^ 2s 2s) = • 


□ 


4. From Bisection to Cut: Proof of Theorem 1.6 

Let X±(a) := {i : <jj = ±1} be the partition of [n] induced by a and m(a) := \ \ a i the difference 

in size of its two sides. Note that by the invariance of cute (a) under the symmetry a —>• —cr, it suffices 
to compare the cuts in 5+ = {a G { —1,+1}” : m(a) > 0} to those in Q n . To this end, define the map 
T : 5+ —>• kl n where we flip the spins at the subset V(a) of smallest m(a) indices within l + (a_), thereby 
moving all those indices to l~(T(a_)). Let X(a ), Y(a) and Z(a) count the number of edges from V(a) to 
X“(u), I~(T(a)) and X + (T(g _)) = T + (ct)\U(ct), respectively. Fixing 0 < 5 < 1/4 let 

S* = jer € Sn : m(a) < 7 _<5 n| . (4.1) 

Then, for a* € 5+ such that MaxCut(G n ) = cutG„(<7*) we have 

MaxCut(G„) = cut Gn (T(a*)) + X(a*) - Z(a*) < MCUT(G n ) + Y(a*) - Z(a *). 

Considering the union over a € S*, we get that 


P[MaxCut(G n ) > MCUT(G n ) + A n ] < 2 n max P Y(a) - Z(a) > A n + P [a* (£ S*] 

cr£S* L 


(4.2) 


= : qi(n) + q 2 (n). 

In proving part (a) of Theorem 1.6, we consider w.l.o.g. the Erdos-Renyi random graphs G n ~ Gi(n, —) 
as in Remark 1.3. For fixed a 6 5+ each of the independent variables Y(a) and Z(a) is Bin(lV, j/n) for 
N = m(a)(n/ 2). Upon computing the m.g.f. of Y(a) — Z(a) we get by Markov’s inequality that for any 

e > o, 

A " -\N 


Y(a) - Z(a) > A r 


-26A„ n , 4 7 „• I 2 


< e - L sinlr 

n 

Setting A n = for some if} € (1 — <5,1) fixed and the maximal N = \n 2 7 _<5 for a € S *, we deduce that 

Y(a) ~ Z(a) > A J < -2[9^ /2 - 7 1 " 5 sinh 2 (0)] =: - J . (4.3) 


lim sup n 1 log P 

n—>-oo 

Since 7 / > 1 — 6 we have that 7 1-5 sinh 2 ( 7 - ^/ 2 ) —>• 0, so taking 9 = results with J > 1 for all 7 large 

enough, in which case qi(n) = o(l) (see (4.2)). As for controlling g 2 (n), recall Theorem 1.2 that w.h.p. 
MaxCut(G n ) > MCUT(G n ) > 777 / 4 . Hence, considering the union over g_(/i S* we have that 

< 72 (ti) < 2 n max P (cute (cr) > ^ ) . 
a£S* 4 

For our Erdos-Renyi graphs cutG n (a) ~ R & := Bin(A:(n — k), Y) with k = ^ — m(<r). Taking the maximal 
k * := 777 ~ d for g_(ji S* and computing the relevant m.g.f. yields, similarly to (4.3), that for fi(9) = e e —l, 
f2(9) = e e — 0 — 1 and any 9 > 0, 


lim sup n 1 logP( ) <-[/ 2 (6») - 7 1 28 fi(9) := -J' 


7. 


(4.4) 
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Since 7/2(7 1//2 ) is uniformly bounded while 7 1 2<5 /i(7 1//2 ) = 0 7 (7 1 ^ 2 2S ) diverges (due to our choice of 
6 < 1/4), it follows that for 0 = 7 -1 / 2 and 7 large enough, J' > 1 hence q^in) = o(l), thereby completing 
the proof. 

The Erdos-Renyi nature of the graph G n is only used for deriving the large deviation bounds (4.3) and 
(4.4). While slightly more complicated, similar computations apply also for G Reg (n,7). Indeed, in this case 
Y(a) — Z(a) corresponds to the sum of spins in a random sample of size 7777(a) taken without replacement 
from a balanced population of 777. spins (so by standard tail estimates for the hyper-geometric law, here 
too the LHS of (4.3) is at most —1 for any 7 large enough). Similarly, now Ry* counts the pairs formed 
by uniform matching of 777 items, between a fixed set of 7 k* items and its complement (so by arguments 
similar to those we used when proving Lemma 3.3, the LHS of (4.4) is again at most —1 for large 7). With 
the rest of the proof unchanged, we omit its details. 
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